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This paper considers the problem of multi-sample nonparametric 
comparison of counting processes with panel count data, which arise 
naturally when recurrent events are considered. Such data frequently 
occur in medical follow-up studies and reliability experiments, for 
example. For the problem considered, we construct two new classes 
of nonparametric test statistics based on the accumulated weighted 
differences between the rates of increase of the estimated mean func- 
tions of the counting processes over observation times, wherein the 
nonparametric maximum likelihood approach is used to estimate 
the mean function instead of the nonparametric maximum pseudo- 
likelihood. The asymptotic distributions of the proposed statistics 
are derived and their finite-sample properties are examined through 
Monte Carlo simulations. The simulation results show that the pro- 
posed methods work quite well and are more powerful than the ex- 
isting test procedures. Two real data sets are analyzed and presented 
as illustrative examples. 

1. Introduction. Consider a study that concerns some recurrent event, 
and suppose that each subject in the study gives rise to a counting process 
N(t), denoting the total number of occurrences of the event of interest up 
to time t. Also suppose that for each subject, observations include only the 
values of N{t) at discrete observation times or the numbers of occurrences of 
the event between the observation times. Such data are usually referred to 
as panel count data [Sun and Kalbfleisch (1995), Wellner and Zhang (2000)]. 
Our focus here will be on the situation when such a study involves A; (> 2) 
groups. Let Ai{t) denote the mean function of N{t) corresponding to the /th 
group for I = 1, . . . ,k. The problem of interest is then to test the hypothesis 
Ho:Ai{t) = --- = Ak{t). 
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A number of authors have discussed the analysis of recurrent event data 
when each subject in the study is observed continuously over an interval or 
when the exact times of occurrences of the recurrent event are known. For 
example, the book by Andersen et al. (1993) presents many of the commonly 
used statistical methods for the analysis of recurrent event data. In contrast, 
there exists limited research on the analysis of panel count data. Sun and 
Kalbfleisch (1995) and Wellner and Zhang (2000) studied estimation of the 
mean function of N{t). Sun and Wei (2000) and Zhang (2002) discussed re- 
gression analysis for such data. To test the hypothesis Hq, Thall and Lachin 
(1988) suggested transforming the problem to a multivariate comparison 
problem and then applying a multivariate Wilcoxon-type rank test. Sun 
and Fang (2003) proposed a nonparametric procedure for this problem, but 
their procedure depends on the assumption that treatment indicators can 
be regarded as independent and identically distributed random variables, 
which may not be the case in practice. Park, Sun and Zhao (2007) proposed 
a class of nonparametric tests for the two-sample comparison based on the 
istonic regression estimator of the mean function of counting process. Zhang 
(2006) also presented nonparametric tests for the problem based on the non- 
parametric maximum pseudo- likelihood estimator, which is equivalent to the 
istonic regression estimator [Wellner and Zhang (2000)]. Also, Wellner and 
Zhang (2000) showed through Monte Carlo simulations that the nonpara- 
metric maximum likelihood estimator (NPMLE) of the mean function is 
more efficient than the nonparametric maximum pseudo-likelihood estima- 
tor (NPMPLE). However, no nonparametric tests have been discussed in 
the literature for panel count data based on the NPMLE, since the NPMLE 
is more complicated both theoretically and computationally. It is, there- 
fore, particularly important to develop nonparametric tests based on the 
NPMLE for panel count data. One would naturally expect the tests based 
on the NPMLE to be more powerful than the tests based on the NPMPLE. 
However, unlike the isotonic regression estimate, the maximum likelihood 
estimate has no closed-form expression and its computation requires an it- 
erative convex minorant algorithm. In this paper, we propose some nonpara- 
metric tests based on the maximum likelihood estimator and then compare 
them with the existing tests for the problem of multi-sample nonparametric 
comparison of counting processes with panel count data. 

The rest of the paper is organized as follows. Section 2 discusses estimation 
of the mean function and the existing nonparametric tests for the hypothe- 
sis Hq when only panel count data are available. The asymptotic normality 
of the functional of the NPMLE is established, while its proof is presented 
in Section 6. Section 3 presents two classes of nonparametric test statistics. 
The statistics, motivated by the property of the NPMLE and the idea used 
in survival analysis, are formulated as the integrated weighted difference be- 
tween the rates of increase of the estimated mean functions corresponding to 
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the pooled data and each group or two groups. The asymptotic normahty of 
these test statistics is also established, while proofs are given in Section 6. In 
Section 4, finite-sample properties of the proposed test statistics are exam- 
ined through Monte Carlo simulations. In Section 5, we apply the proposed 
methods to two data from a floating gallstones study and a bladder tumor 
study, respectively. 

2. Nonparametric maximum likelihood estimation of mean function. Well- 
ner and Zhang (2000) studied two estimators of the mean of a counting pro- 
cess with panel count data: the nonparametric maximum pseudo-likelihood 
estimator and the nonparametric maximum likelihood estimator. To de- 
scribe the test statistics, we introduce first the NPMLE. Suppose that = 
{N(t) :t >0} is a nonhomogeneous Poisson process with the mean func- 
tion E(N{t)) = A(){t). Suppose that K is an integer-valued random variable 
and T = {Tkj,j = 1, . . . , fc, = 1, 2, . . .} is a random triangular array, where 
Tkj-i < Tkj and T^fi = 0, for j = 1, . . . ,k and k = 1,2, . . . . We assume that 
{{K;Tk,i, ■ ■ ■ ,Tk,k)} are independent of N. Let X = {K,Tk, Nk), where 
Tfc is the fcth row of the triangular array T and N/^ = (A^(Tfc^i), . . . , A^(T/; fc)). 
Then, Xi = (Ki,TK.,Ni^Ki), i = I, ■ ■ ■ ,n is a random sample of size n from 
the distribution of X. Let X = {Xi, . . . ,Xn)- Then, the log-likelihood func- 
tion for the mean function A is 

n Ki 

/„,(A|x) = ^^(iv,(TK,.,) -iv.(r^„,-i))iog(A(r^,,,) - A(rK,j-i)) 
i=ij=i 

i=l 

after omitting the parts independent of A. 

Let ti < ■ ■ ■ < tm denote the ordered distinct observation time points in 
the set of all observation time points {T^-. j,j = l,...,Ki,i = l,...,n}. Then 
the NPMLE of Aq, A„, is defined to be the nondecreasing, nonnegative step 
function with possible jumps only occurring at t£,i = 1, . . . ,m that maxi- 
mizes /r!,(A|X). Wellner and Zhang (2000) gave the characteristic and the 
algorithm for computing this estimator, and studied its asymptotic proper- 
ties. 

Next, we need some more notation, some of which was introduced by 
Schick and Yu (2000) and Wellner and Zhang (2000). Let B denote the 
collection of Borel sets in TZ, and let ^[o,t] = {B H [0,t]: B e B}. Define 
measures ni, ^2, fJ^s and v on ([0, r], i3[o^T-]) by 

oo k 

fii{B) = Y,PiK = k)Y^ P{Tkj eB\K = k), 
k=i j=i 
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k=i j=i 



HsiBi X B2XB3 



fc-i 



J2 P{K = k)Y, P{Tk,j-i G Bi,Tk,j G B^^T^^j+i G Bi\K = k) 



k=2 



and 



'{Bi xB2) = J2 = ^)PiTk,k-i G Bi,Tk,k GB2\K = k) 



k=l 

for S,Si,S2,S3G%,]. 

The existing nonparametric tests [Park, Sun and Zhao (2007), Zhang 
(2006)] are based on the asymptotic normahty of a smooth functional of the 
nonparametric maximum pseudo-hkehhood estimator (the istonic regression 
estimator) A„, 

- K 

Wit){Anit)-Aoit)}df,iit) = P 



-i=i 

where W{t) is a weight function and P is the probabihty measure of X, 
Pf = J f dP. However, it is unknown if the asymptotic normahty of the func- 
tional of the nonparametric maximum likelihood estimator Jq W{t){An{t) — 
Ao{t)} dfii{t) still holds. We observe a key to the proof of such asymptotic 
normality is to use an important characteristic of the A„ given by 



n Ki 



(2.1) 



Y,Y.^{AniTK,M^n{TK,,j)-N,{TK,^,))=Q 

i=l j=l 

for any real function Lp. However, from (2.13) of Wellner and Zhang (2000), 
the corresponding characteristic of the NPMLE can be written as 



n [Ki-l 



E 



(2.2) 



where 



and 



E ^n{TK„j] 



i=l L j=l 



AN,{Tk,j+i) AiV,(TK„j) 

AAn{TK^,j+l) AAniTK^j) 



+ An{TK,,K,)\ 1 , , 

I AAn{TK„K,) 

AA{TK,j) = A{TK,j) - A{Tkj-i) 
AN{Tkj) = N{Tkj) - N{Tk,j-i). 



0, 
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Equation (2.2) can be extended to the form 

V IfiAJTK- ,■) An TV ,)<^ i^^J + ^l n 



n \K,-1 

E 



i=ilj=i .AA„(r^,,,+i) AAJT;,^,,) 

(2.3) 

+ 9.(A„(r^,,^j)A„(r^.,^j|i-^^^ii5^| =0, 

I AA„(rK„i^JJJ 

which will be shown in Lemma 1. Clearly, the structure of (2.3) is different 
from that of (2.1) and is much more complicated. This is why the derivation 
of the asymptotic property of /J" VF(t){A„(t) — Ao(t)} d^i(t) has not been 
done yet. So, we need to develop a new form of the test statistic when 
the NPMLE is used to estimate the mean function of counting process with 
panel count data. Motivated by such characteristic of the NPMLE, we define 

It is easy to see that Pf\{X) can be expressed as 

For establishing asymptotic results on Pf^ (X), we need the following 
regularity conditions: 

A. There exists a constant Kq such that P{K < Kq} = 1 and that the 
random variables T^j's take values in a bounded set [To,r], where To,r € 
(0,oo). 

B. The mean function Aq is strictly increasing such that Ao(ro) > and 
Ao(t) < M for some constant M £ (0,oo). 

C. There exists a constant Lq such that 

pf min {AoiTK,j)-AoiTK,j-i))>Lo] = l. 

D. E'je'^^^*^} is uniformly bounded for t G [0,t] and some constant c. 

E. /ii({To}) > and for all tq < n < r2 < r, Hi{{ti,T2)) > 0. 

Condition C holds if Aq is differentiable, Aq has a positive lower bound 
in [To,r] and P{mmi<j<KiTK,j — Tkj-i) > sq} = 1 for some fixed time sq, 
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where sq can be considered as the smahest length of consecutive observation 
times. Condition E holds if P{Tk,i = tq} > and iJ,'i{t) > for t G (tq, r). 

Now, let Aq ^ denote the inverse function of Aq, and let Wo Aq ^ denote 
composition of two functions W and Aq^. Zhang (2006) established the 
asymptotic normality of W{t){An{t) — Ao(t)} d^i(t), when WoA^^ is not 
only bounded Lipschitz but also monotone. However, the assumption that 
W o Aq ^ is monotone is not required for the tests with interval-censored 
data as a special case of panel count data [see Huang and Wellner (1995) 
and Zhang, Liu and Zhan (2001)]. Here, we do not need this monotonicity 
condition for W o Aq ^. 

Theorem 2.1. Suppose that Conditions A, B, C, D and E hold. Further, 
suppose that W{t) is a bounded weight process such that WoAq^ is a bounded 
Lipschitz function. Then, as n —> oo, 



(2.5) 



in distribution, where Uw has a normal distribution with mean zero and 
variance a'?„ with 



E 



(2.6) 



K-l 



AN{T, 



K,j) 



+ W{Tk,k)Ao{Tk,k)\i 



AN{Tk,k) 
AAo{Tk,k) 



which can be consistently estimated by 



1 



n 



(2.7) 



E 



1=1 ^AAn[lK„j+l) AAn{lKi,j) 



3. Nonparametric tests. Consider a longitudinal study that is concerned 
with some recurrent event and involves n independent subjects, n/ in the 
/th group with ni + • • • + = n and k>2. Let Ni{t) denote the counting 
process arising from subject i and A;(t) {I = 1, ... ,k) he as defined before, 
for i = 1, . . . ,n. Suppose that each subject is observed only at discrete time 
points < Txi,! < ■ ■ ■ < Txi^Ki and that no information is available about 
Ni{t) between observation times; that is, only panel count data are available. 
For simplicity, assume that Hq is true and let Aq (t) denote the common mean 
function of the Ni{tys. 
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Let Km denote the nonparametric maximum likelihood estimate of A; 
based on samples from all the subjects in the /th group, and A„ based on 
the pooled data. To test the hypothesis Hq^ motivated by our asymptotic 
results in Section 2 and an idea commonly used in survival analysis [e.g., 
Andersen et al. (1993), Pepe and Fleming (1989), Petroni and Wolfe (1994), 
Cook, Lawless and Nadeau (1996), Zhang, Liu and Zhan (2001), Park, Sun 
and Zhao (2007), Zhang (2002, 2006)], we propose the statistics 

1 n VKi-l 

v'^ i=iVj=i 



(3.1) 



rAA„,(rx„,-+i) AA„,,(rx,j) 



I AA„(Tk,,j+i) AA„(rA',, 

W^\TK,,KyK{TK,,K.^ 



AA„ (Tx^i^J 



(for I = 1, . . . ,k) and 



1 " 



■ i=i 



(3.2) 



/ AA„,(rx„,+i) AKATk.j] 
V AAniTK,,j+i) AA„,(Tx,j) 

/AA„,(rK,,,+i) AA„,(r^,,, 



V AA„(r^,,,+i) 



_ AA^JTV^ 



(for / = 2, . . . , A;), where wi'^(t)'s are bounded weight processes. The statis- 
tic Un^ is the integrated weighted difference between the rates of increase of 
A„ and over the observation times and the statistic Vn^ has a similar 
meaning. For the selection of the weight process Wn^ (t) , a simple and natu- 
ral choice is wji''''\t) = 1,1 = 1, . . . ,k. Another natural choice is Wn^'^\t) = 
Y^n{t) = ^27=1 -^(^ ^ TKi,Ki)/n, I = 1, . . . ,k, in which case weights are propor- 
tional to the number of subjects under observation. Based on groups, one 
may choose the weight process as 

W(''^\t)=Ym{t) or ^ or 



Yn{t) 



Yn{t) 
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where (t) {I = 1, . . . ,k) are defined as Yn{t), with the summation being 
only over subjects in the Ith group. Some weight processes similar to Wn 
have been used when recurrent event data are observed [see Andersen et al. 
(1993)]. In addition, J27=i^i^ > TKi,Ki)/n is also chosen as another weight 
process by Zhang (2006). Some other possible choices are 

i-Y^,{t) (i-y„,(t))(i-y.,(t)) 

Now, we state the asymptotic distribution of U„ = {Un \ ■ ■ ■ , Un^)'^ and 



Theorem 3.1. Suppose that Conditions A, B, C, D and E hold. Further, 
suppose that Wn'^ (t) 's are bounded weight processes and that there exists a 
bounded function W{t), such that W o Aq ^ is a bounded Lipschitz function 
and 



(3.3) 



{WP{t)-Wit)fdf,iit) 



nl/2 



Also, suppose that ni/n pi as oo, where < pi < 1, I = 1, . . . ,k and 
Pi + h Pfc = 1 . Then, under Hq : Ai = • • • = A^ = Aq 

(i) JJn has an asymptotic normal distribution with mean vector and 
covariance matrix 



(3.4) 

where 



Su„=rdiag((7?,f7i,...,a^)r 



V 



P2- \ — 
P2 



Pk 
"Pk 



^' Pk J 



and af 



• • = cj| = cr^ given in (2.6). 
(ii) V„ has an asymptotic normal distribution with mean vector and 
covariance matrix 



(3.5) 



Sv„=Hdiag(a?,ai,...,a^)H', 
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where 



H 



V 

and af is as given in (i). 
(iii) In addition, if 



1 

Pi 
1 

Pi 



— W— 



— J — 






1 

Pk ) 



(3.6) 



max E 

l<i<n 



Ki 



for I = 1, . . . ,k, then Su„ and Sv„ can be consistently estimated by 



(3.7) 

and 
(3.8) 

where 



= r„ diag((7?, <t|, . . . , a^)r; 
^Vu =H„diag((7i,<T|,...,<Tfc)H: 



2\TJ' 

ni 



n 



n 

nT 
n 

n ' rw 

n2 




n 
n 







and 



0"; 



n " 



E 



m 
w 

ni 
VK,-1 







\ 


n 
nk J 



E WP{TK,,,)AniTK,,j) 



1=1 L i=l 



(3.9) 



f AiV,(T^^,,+i) AiV.(T^,j) 
lAA„(rK,j+i) AA„(rK.jO 
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+ wil\T^^,^yK{TK.,K.){l - ^^r^ 

L AAn[lKi,Ki 



for 1 = 1,..., k. 



Let Uq denote the first (k — 1) components of U„ and Sq the matrix 
obtained by deleting the last row and column of '^u„ - Then, using Theorem 
3.1, two tests can be carried out for testing Hq by means of the statistic 

Xo = Uq XIq Uq and V^Xly^ V„, which have asymptotically a central x^- 
distribution with (k — 1) degrees of freedom. This can be seen readily from 
the proof of the theorem. 

Remark 1. If the weight process Wn^ is symmetric about Xi, . . . ,Xn, 
then (3.6) is equivalent to 



E 



0. 



Remark 2. For selection of weight processes, Zhang (2006) required 
that Wnit), W{t) and VFoAq^ are monotone. These monotonicity assump- 
tions restrict availability of weight processes. For example, the weight pro- 

^^^^ Y"^(t)+? ^(f) often used in survival analysis, but it is not monotone. In 
addition, the monotonicity assumption on the weight process is not appro- 
priate for deriving optimal tests under alternatives. In the above theorem, 
we have removed these assumptions. Therefore, compared to those stated 
in Zhang (2006), more weight processes are available here. It can be easily 
shown that the weight processes mentioned earlier all satisfy the conditions 
required by the theorem. 

4. Simulation study. To examine the finite-sample properties of the pro- 
posed test statistics and compare them with those of the tests presented 
by Sun and Fang (2003), Park, Sun and Zhao (2007) and Zhang (2006), we 
carry out a simulation study for the two-sample comparison problem. When 

k = 2, the null hypothesis can be tested by Ti = UrP /au and T2 = /ay , 
which have asymptotic standard normal distribution, where 



^ -2 ^ -2 

ay = \ — ^ 

.ni 712 



1/2 



and C/l^\ Vii^ and &[ are as given in (3.1), (3.2) and (3.9), respectively. 
Let TsF, Tpsz and Ti denote the tests presented by Sun and Fang (2003), 
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Park, Sun and Zhao (2007) and Zhang (2006), respectively. Here, we focus 
on evaluating the performance of Ti and T2 and comparing them to those of 
^PSZ; Tz and Tgp- Note that Tz = Tpsz for k = 2. To generate panel count 
data {ki,tij,nij,j = 1, . . . , /cj, i = 1, . . . , n}, we mimic medical follow-up stud- 
ies such as the examples discussed in the next section. We first generate the 
number of observation times ki from the uniform distribution U{1, . . . , 10}, 
and then, given ki, we generate observation times tj^'s from C/{1, . . . , 10}, 
for simplicity. To generate njj's, we assume that iVj's are nonhomogeneous 
Poisson or mixed Poisson processes. In particular, let {vi,i = l,...,n} be 
independent and identically distributed random variables, and given i/j, let 
Ni{t) be a Poisson process with mean function Ki{t\vi) = E[Ni{t)\i'i). Let Si 
denote the set of indices for subjects in group /, Z = 1,2. For the objective 
of the study, we consider two cases as follows: 

Case 1. = ^it for i G Si, Ai{t) = iyitexp{l3) for i G 52; 

Case 2. Ai{t\i^i) = Uit for i G Si, Ai{t) = i^i^/pi for i £ 6*2. 

Figures 1 and 2 display the graphs of the true mean functions for two 
cases with = 1 and different values of /?. It can be seen that the two mean 
functions do not overlap in Case 1 and they cross over in Case 2. 
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For each case, we consider Vi = l and Vi ~ Gamma(2, 1/2), corresponding 
to Poisson and mixed Poisson processes, respectively. For each setting, we 
consider two sample sizes, ni = n2 = 50 and 100, respectively. As mentioned 
earlier in Section 3, we choose the four weight processes 




The NPMLEs A„ and A„; are computed by using the modified iterative 
convex minorant algorithm (MICM) [see Wellner and Zhang (2000)]. All the 
results reported here are based on 1000 Monte Carlo replications using R 
software. 

Tables 1-4 present the estimated sizes and powers of the proposed test 
statistics Ti and T2 and those of the test statistics Tpsz , Tz and Tsf [Park, 
Sun and Zhao (2007), Zhang (2006), Sun and Fang (2003)] at significance 
level a = 0.05 for different values of /3 and the four weight processes based 
on the simulated data for the two cases with Vi = l and Vi ~ Gamma(2, 1/2), 
respectively. When Vi = 1, the Ni{t)'s are Poisson processes; when ~ 
Gamma(2, 1/2), the Ni{t)^s are mixed Poisson processes. The first part of 
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the table is for the situation with the total sample size of 100, and the sec- 
ond part is for the situation with the total sample size of 200. For Case 
1 considered here, the proposed tests display good power properties and 
the powers are close for the four weight processes. As expected, the power 
increases when the sample size increases, and the power decreases in the 
presence of more variability. As seen in Tables 1 and 2, the proposed tests 
with Wn^^ (t) have the best power performance, and the proposed tests based 
on the NPMLE are more powerful than the tests based on NPMPLE when 
more variability exists, as one would expect. For Case 2 considered here, the 
proposed tests also display good power properties, but the powers rely on 
choices of weight processes. As seen in Tables 3 and 4, the proposed tests 
with wji^^ have the best power performance, and the proposed tests with 
appropriate weights based on NPMLE are much more powerful and more ro- 
bust than those based on NPMPLE in this case. For example, when /5 = 5, 8 
for mixed Poisson processes, the new tests with Wn"^^ have good powers, 
but the tests Tpsz and Tz [Park, Sun and Zhao (2007), Zhang (2006)] with 
four weights and Tsf [Sun and Fang (2003)] have very poor powers. For all 
situations considered here, the performance of Ti and T2 are the same. 

Note that the tests with different weights have different powers for Case 2. 
Let's explain why these results are reasonable. In this case, two true mean 
functions cross over at time t = f3, the differences before this time point 



Table 1 





Estimated 


size and \ 


power 


of the propo 


'sed test for 


• Poisson 


processes 


in Case 1 


















Tpsz and Tz 




































711 


= = 50 












0.0 


0.060 


0.058 




0.058 


0.056 


0.063 


0.061 




0.061 


0.063 


0.061 


0.1 


0.298 


0.210 




0.209 


0.194 


0.214 


0.200 




0.200 


0.204 


0.207 


0.2 


0.858 


0.747 




0.748 


0.790 


0.697 


0.667 




0.665 


0.695 


0.693 


0.3 


1.000 


0.987 




0.983 


0.986 


0.981 


0.974 




0.974 


0.968 


0.979 












ni : 


= n2 = 100 












0.0 


0.047 


0.047 




0.047 


0.049 


0.044 


0.046 




0.046 


0.045 


0.043 


0.1 


0.542 


0.472 




0.471 


0.489 


0.423 


0.405 




0.405 


0.411 


0.422 


0.2 


0.993 


0.967 




0.964 


0.991 


0.958 


0.948 




0.947 


0.952 


0.950 


0.3 


1.000 


1.000 




1.000 


1.000 


1.000 


1.000 




1.000 


1.000 


1.000 








Ti 










Ti 












ni = 


n2 


= 50 






ni = 




= 100 






0.0 


0.052 


0.051 




0.051 


0.053 


0.051 


0.049 




0.049 


0.050 




0.1 


0.340 


0.218 




0.218 


0.220 


0.548 


0.479 




0.474 


0.492 




0.2 


0.868 


0.787 




0.764 


0.798 


0.996 


0.976 




0.974 


0.993 




0.3 


1.000 


0.997 




0.995 


0.991 


1.000 


1.000 




1.000 


1.000 
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Table 2 

Estimated size and power of the proposed test for mixed Poisson processes in Case 1 



T2 Tpsz and Tz 



f3 
































n-L 


= = 50 










0.0 


0.043 


0.040 




0.042 


0.045 


0.037 


0.040 


0.040 


0.042 


0.035 


0.1 


0.100 


0.097 




0.097 


0.099 


0.084 


0.085 


0.085 


0.086 


0.083 


0.2 


0.221 


0.205 




0.207 


0.204 


0.185 


0.184 


0.184 


0.180 


0.183 


0.3 


0.458 


0.407 




0.408 


0.415 


0.380 


0.375 


0.375 


0.379 


0.370 












711 ■- 


= n2 = 100 










0.0 


0.043 


0.041 




0.041 


0.046 


0.048 


0.045 


0.045 


0.044 


0.046 


0.1 


0.140 


0.125 




0.125 


0.138 


0.114 


0.111 


0.111 


0.112 


0.111 


0.2 


0.410 


0.364 




0.362 


0.368 


0.317 


0.307 


0.307 


0.314 


0.316 


0.3 


0.708 


0.663 




0.662 


0.672 


0.596 


0.592 


0.592 


0.593 


0.590 








Ti 








Ti 












ni = 


n2 


= 50 






ni = n2 


= 100 






0.0 


0.054 


0.048 




0.048 


0.046 


0.048 


0.047 


0.047 


0.051 




0.1 


0.108 


0.102 




0.102 


0.100 


0.142 


0.126 


0.123 


0.137 




0.2 


0.216 


0.205 




0.206 


0.207 


0.412 


0.388 


0.390 


0.391 




0.3 


0.474 


0.404 




0.402 


0.437 


0.710 


0.672 


0.670 


0.671 





Table 3 

Estimated power of the proposed test for Poisson processes in Case 2 



T2 Tpsz and Tz 





















TsF 










Til 


= 712 = 50 










3 


1.000 


0.787 


0.766 


1.000 


0.956 


0.900 


0.899 


1.000 


0.955 


5 


0.969 


0.080 


0.077 


1.000 


0.189 


0.113 


0.111 


0.880 


0.188 


8 


0.127 


0.674 


0.688 


0.993 


0.403 


0.559 


0.562 


0.069 


0.400 










Til : 


= 712 = 100 










3 


1.000 


0.964 


0.960 


1.000 


0.999 


0.993 


0.993 


1.000 


0.999 


5 


1.000 


0.078 


0.079 


1.000 


0.290 


0.140 


0.139 


0.988 


0.284 


8 


0.222 


0.935 


0.939 


1.000 


0.670 


0.843 


0.846 


0.082 


0.667 






Ti 




















ni = 712 


= 50 






ni = 712 : 


= 100 






3 


1.000 


0.784 


0.769 


1.000 


1.000 


0.958 


0.955 


1.000 




5 


0.969 


0.088 


0.086 


1.000 


1.000 


0.084 


0.083 


1.000 




8 


0.130 


0.675 


0.689 


0.995 


0.232 


0.932 


0.935 


1.000 





and after this time point have different signs; the difference after this point 
seems to dominate the difference before this point for the cases of /? = 3, 5 
and seems to be dominated by the difference before this point for the case 
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Table 4 

Estimated power of the proposed test for mixed Poisson processes in Case 2 



T2 Tpsz and Tz 



































711 


= 772 = 50 










3 


0.858 


0.301 




0.294 


0.992 


0.386 


0.321 


0.318 


0.708 


0.380 


5 


0.424 


0.078 




0.078 


0.943 


0.089 


0.071 


0.071 


0.289 


0.086 


8 


0.062 


0.255 




0.263 


0.771 


0.117 


0.158 


0.158 


0.039 


0.111 












771 - 


= 712 = 100 










3 


0.992 


0.534 




0.530 


1.000 


0.695 


0.594 


0.594 


0.949 


0.691 


5 


0.677 


0.071 




0.072 


1.000 


0.100 


0.065 


0.065 


0.473 


0.095 


8 


0.096 


0.434 




0.437 


0.961 


0.185 


0.280 


0.280 


0.067 


0.182 








Ti 








Ti 












Til = 




= 50 






ni = 7i2 ■ 


= 100 






3 


0.858 


0.299 




0.289 


0.991 


0.993 


0.533 


0.529 


1.000 




5 


0.396 


0.074 




0.074 


0.942 


0.685 


0.068 


0.066 


1.000 




8 


0.063 


0.259 




0.268 


0.771 


0.094 


0.432 


0.438 


0.960 





of 13 = 8. When /3 = 3,5, the tests with wi^^ and VF^^^ have better powers 
than those with wj^^ and wl^\ and the test with wji^^ has the largest 
power since it weights the difference at later times more than those with 
wjl'\ Wn^^ and Wn^\ In particular, when (3 = 5, the powers of the tests 

(2) (3) 

with WA ' and W^ ' are very poor. This is because the small difference with 
large weights before this point and the large difference with small weights 
after this point seem to cancel each other. When /? = 8, the tests with Wn , 
Wn^^ and wi^^ perform better than the test with Wn^\ When /? = 8, the 

biggest difference between two mean functions occurs at an earlier time, so 

(2) (3) 

that the tests with Wn and Wn have reasonable powers. But the test 
with Wi^^ = 1 has a poor power though the difference at earlier times seems 
to dominate the difference at later times. This can be understood from the 
expressions of the test statistics U„ and V^, where the differences with 
different signs multiplied by the value of the mean function may cancel each 
other, since the mean function takes small values at earlier times and large 

values at later times. When (3 = 8, the test with Wn^^ still perform well. This 
is because it puts zero weight at earlier times and heavier weight at later 
times. Similar situations happened in real examples considered in the next 
section. 

To evaluate the asymptotic result given in Theorem 3.1, the quantile 
plots of the test statistic T2 against the standard normal distribution are 
constructed. Figures 3 and 4 present the plots for the cases with Wn{t) = 

Wn^\t) and n = 100 and n = 200, respectively, and they clearly reveal that 
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the asymptotic approximation is very good. Similar plots were obtained for 
test statistic Ti and other situations as well. 

In the above simulation study, we did examine all four weight processes 
suggested earlier in Section 3; in Case 1, the weight process Wi^^ yielded 
slightly higher power than the other three weight processes, and in Case 2, 
the weight process wi^^ yielded the largest power. These simulation results 
suggest that, when the mean functions do not cross over, the test with the 
equal weight has a good power; otherwise, the test with the unequal and 
appropriate weight will also have a good power. In general, one can choose 
the weight process based on the behavior of the NPMLEs of the mean func- 
tions to improve power, since the true mean functions are unknown. When 
the difference of mean functions at earlier times dominate the difference at 

(2) (3) 

later times, the tests with Wn and Wn tend to have good powers; when 
the difference of mean functions at later times dominate the difference at 
earlier times, the test with Wjl^"* tends to have a good power. In addition 
to the four processes considered here, some other weight processes can be 
found in Andersen et al. (1993), which discusses nonparametric treatment 
comparison based on recurrent event data. It would, therefore, be of great 
interest to investigate the problem of the selection of a weight process based 
on data. 

^to^mal Q-Q Plot 



to 
a 




-10 12 3 

Theoretical Quantiiss 

Fig. 3. Simulation study. Normal quantile plot for T2 (n = 100). 
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The new tests based on the NPMLE are more powerful and more robust 
than the existing tests based on the NPMPLE. One possible reason is that 
the NPMLE is more efficient than the NPMPLE. The main drawback of 
the NPMPLE is that the dependence of events within a subject is ignored. 
Another reason is that the structure of new test statistic is more reasonable, 
since it is based on the characteristic of the NPMLE. 

5. Illustrative examples. To illustrate the proposed method, we consider 
here two examples: a floating gallstones study and a bladder tumor study. 

5.1. A floating gallstones study. Thall and Lachin (1988) described a 
follow-up study on patients with floating gallstones. The data consist of the 
first year follow-up of the patients in two study groups, placebo (48) and 
high-dose chenodiol (65), from the National Cooperative Gallstone Study. 
The observed data include the successive visit times in study weeks and the 
associated counts of episodes of nausea for patients in different treatment 
groups [see Table 1 of Thall and Lachin (1988)]. The whole study consists of 
916 patients who were randomized to placebo, low dose or high dose group 
and followed for up to two years. During the study, patients were scheduled 
to return for clinical visits at 1, 2, 3, 6, 9 and 12 months. In reality, most 
of the patients visited about six times within the first year, but actual visit 
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Fig. 5. Floating gallstone study. Estimates of the mean functions. 



times differ from patient to patient. Some patients had only one visit and 
some had 9 visits. As pointed out by Thall and Lachin (1988), there is no 
evidence that the number of observations and actual observation times are 
related to the incidence of nausea, and so it seems reasonable to assume 
that conditions required for the asymptotic results hold in this case. The 
problem of interest here is to compare the two treatment groups in terms of 
the incidence rates of nausea. 

To test the difference between the two groups, we treated the placebo 
group as Group 1 (Ai(t)) and the high-dose chenodiol group as Group 2 
(A2(t)) and applied the proposed method to the data from 113 gallstone 
patients in the two groups to test the null hypothesis HQ:Ai{t) = A2{t). 
The nonparametric maximum likelihood estimators of the incidence rates 
of nausea and the increments of the estimators are shown in Figures 5 and 
6. We obtained Ti = 0.175 and T2 = 0.206 with Wn{t) = wi^\t), giving p- 
values of 0.861 and 0.837 based on the standard normal distribution, and 
Ti = -337.221, -494.571 and 241.159 and T2 = -193.238, -283.739 and 
138.311 with Wnit) = Wi^\t), wi^\t) and 1 - wi^\t), which correspond 
to p-values ^ 0.0001. The proposed tests with appropriate weights suggest 
that the incidence rates of nausea were significantly different for the patients 
in the two groups, and this agrees with the results given in Thall and Lachin 
(1988); the proposed unweighted test fails to reject Hq. This can be easily 
understood by looking at the behavior of increments of the estimators. From 
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Fig. 6. Floating gallstone study. Increments of the estimated mean functions. 

Figure 6, we can see clearly that the increment of the mean event rate in the 
placebo group is higher than that in the high dose group at earlier times and 
in contrast, the increment of the mean event rate in the high dose group is 
higher than that in the placebo group at later times in the year. So, the test 
with equal weights could not detect the difference between two groups. In 
comparison, the use of the approach in Sun and Fang (2003) gave a p- value 
of 0.1428; Park, Sun and Zhao (2007) gave p- values 0.454, 0.417 and 0.413 
with three weights, respectively; and the tests presented by Zhang (2006) 
would give the same result as above. Thus, none of the existing tests based 
on NPMPLE can detect the difference of two treatments, and the proposed 
tests with suitable weights have detected successfully that, as we expected. 
One possible reason for this is that the nonparametric maximum likelihood 
estimator is more efhcient than the nonparametric pseudo-likelihood esti- 
mator. 

5.2. A bladder tumor study. We consider a bladder tumor study con- 
ducted by the Veterans Administration Co-operative Urological Research 
Group (VACURG), and the data are presented in Andrews and Herzberg 
(1985). For some earlier analyses of these data, one may refer to Byar, 
Blackard and The VACURG (1977), Byar (1980), Wellner and Zhang (2000), 
Sun and Wei (2000), and Zhang (2002, 2006). The data were obtained from a 
randomized clinical trial. All patients had superficial bladder tumors when 
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they entered the trial, and they were assigned randomly to one of three 
treatments: placebo, thiotepa a pyridoxine. At subsequent follow-up vis- 
its, any tumors noticed were removed and treatment was continued. The 
study included 116 patients, of which there were 47 in placebo group, 38 
in thiotepa group and 31 in pyridoxine. We can get a set of panel count 
data {ki,tij,nij,j = 1, . . . , fcj, i = 1, . . . , n} where for the ith patient, ki is the 
number of visits, tjj's are all visit times and riij is total number of tumors 
until tij (j = 1, . . . , ki). The objective of the study is to determine the effect 
of treatment on the frequency of tumor recurrence. 

Let Ai(t),A2(t) and A3(t) be the mean functions corresponding to the 
three treatment groups: placebo, thiotepa and pyridoxine, respectively. The 
nonparametric maximum likelihood estimators of mean functions and their 
increments from the three groups are presented in Figures 7 and 8, respec- 
tively. We observe from Figure 7 that the difference of the three groups be- 
comes larger when the time increases. To test the null hypothesis Hq : Ai(t) = 
A2(t) = A3(t), we applied the proposed method to this panel count data. 
We obtained Xo = 3.617,3.269 and p-value = 0.164,0.195 with Wnit) = 1, 
Xo = 1196123,300179.2 and p- values < 10"^ with Wn{t) = y„(i), and Xo = 
489000.4,121908.1 and p-values < 10"^ with Wn{t) = 1 - Yn{t), based on 
the asymptotic distributions for test statistics U„ and V„ given in Theo- 
rem 3.1, respectively. The proposed tests having weights suggest that the 
frequency of tumor recurrence are significantly different for the patients in 
the three groups at 0.01 level of significance, while the proposed unweighted 
test fails to detect the difference. This can also be understood from the be- 
havior of the increments of the estimated mean functions shown in Figure 8. 
Incidentally, through a regression analysis of the data from two treatments, 
placebo and thiotepa, Sun and Wei (2000) and Zhang (2002) concluded that 
thiotepa effectively reduces the recurrence of tumors. However, the existing 
test procedures [Sun and Fang (2003), Zhang (2006)] based on NPMPLE 
fail to reject the null hypothesis at level 0.05. 

These examples illustrate that different weights may result in different 
conclusions, and the tests with appropriate weight process could lead to 
better power of the test. Therefore, the selection of a suitable weight process 
would be important for detecting difference between groups. 

6. Proofs. In this section we present the proofs of Theorems 2.1 and 3.1. 

6.1. Proof of Theorem 2.1. We begin with some preliminary results. For 
convenience, let us first recall some notation given in Wellner and Zhang 
(2000). Set 

JT = {A : [0, r] [0, oo) | A is nondecr easing, A(0) = 0}. 
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Fig. 7. Bladder tumor study. Estimates of the mean functions. 
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Fig. 8. Bladder tumor study. Increments of the estimated mean functions. 
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Let ti < t2 < • • • < denote the ordered distinct observation time points 
in the set of all observation time points {Tft-. j = 1, . . . , Ki,i = 1, . . . , n}. 
Also, let = {u = {ui,U2, • • • , Um) : < ui < • • • < Um < oo} and the map 
A:J~^^ be defined by 

u = ^(A) = (A(ti),A(t2),...,A(t™)) for all A G ^. 

We also define a rank function R: {Tj^.j : j = 1,2, ... ,Ki;i = 1,2, ... ,n} ^ 
{1, 2, . . . , m} such that 



if Txij = tg. 



Then, the log-likelihood function can be rewritten as 



(u|X)=^ 



Y,{N,{TK,j)-Ni{TK,^,_,)} 



1=1 lj=i 



and the NPMLE A„ of Aq is then given by 



Set 



where 



(A„(ti), A(t2), . ..,Kn{tm)) = Un = argmax(/)(u|X). 

u€£2 



-^Ku) = ^^^^ = E ^^M) for £ = 1, 2, . . . , m. 



dui 



i=l 



+ 



\^{Tk 



Lemma 1. Let ip he any real function. Then, 

m ( n ^ 

(6.1) E'^(^^) E<^^/(") =0- 

1=1 U=i J 
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Proof. Let aj = A„(tj) — A„(tj_i), j = 1, . . . ,m. Using arguments sim- 
ilar to Proposition 2.1 of Groenebom (1996), we have 

y:^=0 ifa.>Oor^ = l. 

^ OUj 

Let ifc^ < tfcj < • • • < tfcp be jump points of A„. Then, 

m n 

^^^i,^(u) = 0, j = l,...,p. 



and so 



Thus, 



kj<t<kjj^\ i=l 



'P(.ue)Yct>i,e{u) = 0, j = l,...,p-l, 

kj<(.<kjj^\ 1=1 

since U£ = A{t£) is a constant for kj < i < /c_,+i. Therefore, we conclude 
that 

m n 

t=i 1=1 
Hence, the lemma follows. □ 

Now, let fii be as defined in Section 2, and let di be the L2{iJ,i) metric on 
J^, i = l,2. Then, for Ai,A2 GJ^, 



(6.2) 



df(Ai,A2)= / \Ai{t) - A2{t)\' dfiiit) 

K 



E 



■i=i 



and 



di(Ai,A2) 



(6.3) = IJ\iAi{s)-Aiit))-iA2{s)-A2it))fdf,2{s,t) 



E 



K 



5]{(Ai(rK,,) - Ai(r^,,„i)) - (A2(rK,,) - A2(r^,,_i))}^ 



24 N. BALAKRISHNAN AND X. ZHAO 

If < Kq) = 1 for some constant Kq, then we have 

(6.4) (Ai , A2) < di ( Ai , As) < Kod2 (Ai , A2) . 
Wellner and Zhang (2000) showed that 

(6.5) di(A„,Ao)^0, 

and hence that the uniform consistency of A„ can be shown by using argu- 
ments similar to Proposition 5 of Schick and Yu (2000) under Conditions A, 
B, D and E; that is, 

(6.6) sup |An(t)-Ao(t)| ^0. 

te[T0,T] 

Note that the uniform consistency of A„ imphes that for every < (5o < 
min{Lo/2, Ao(ro)} and any e > 0, there exists a positive integer such that 

(6.7) sup p( sup |A„(t)-Ao(t)| >(^o| <£■ 

n>7Ve U6[ro,r] J 

Here, we fix 60 . Let 

(6.8) .Fo = (a:Ag.F, sup |A(t)-Ao(t)|<<5o|. 

I te[ro,T] J 

Define A* as 

r n Ki ^ 

a; = argmax 5]5](AiV,(r^^,,)log(AA(r^,,,)) - AA{Tk^,)) , 

where Q is the class of nondecreasing step functions with possible jumps only 
at the observation time points {TK^j,j = 1, . . . , Ki, i = 1, . . . , n}. Clearly, we 
have 

(6.9) sup P(A„ / A;) < sup p( sup |A„(t) - Ao(t)| > 5o] < e. 
Lemma 2. We have fii(A* , Aq) = Op{n~^/^). 

Proof. To establish the rate of convergence for A* , we shall apply The- 
orem 3.2.5 of Van der Vaart and Wellner (1996). Define 

K 

mA(X) = Y^[{N{Tk,,) - NiTKj-i)) log{AiTKj) - A(rKj-i)} 

(6.10) 

-{A{Tk,)-A{Tkj-i)}] 
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and 

(6.11) M{A) = PmA{X). 

Let h{x) = x(log{x) — 1) + 1. Then, h{x) > ^{x — 1)^ for x in a neighborhood 
of X = 1. Thus, in a neighborhood of Aq, 

M(Ao) -M(A) 

~{ \ A(lK,j) - A[lK,j~l) 

1 ff{{Ao{u)-A{u))-{Ao{v)-Aiv))r . . 

afi2 [u,v) 



5JJ A{u)-A{v) 
>ci(i2(A,Ao) 

for some constant ci , and hence the separation condition of the theorem is 
satisfied. Also, let 

(6.12) J^s = {^-di{A,Ao)<5,AGj^o} (^ > 0) 
and 

(6.13) Ms = {mA{X)-mA,iX):AeJ's}- 

Note that J-s is a class of monotone nondecreasing functions. Then, it follows 
from Theorem 2.7.5 of Van der Vaart and Wellner (1996) that for any rj > 0, 
there exists a set of brackets {[Af', A^] :i = 1, . . . , J}, where J < e^^/'' for 
some constant C2 and di{A^ , Af) < rj such that for any A S .F^, A^ < A < A^ 
for some i with 1 < z < J. Note that Af , Af (i = 1, . . . , J) may not belong to 
J- 5, and so they may not have a uniform positive lower bound and a uniform 
finite upper bound in [ro,r]. Also note that for any A € .Fq, we have from 
Conditions A, B and C that 

< Ao(ro) -6q< Ao(t) -6q< A{t) < Ao{t) +6o< Ao(t) +6o< M + 6o 

for t £ [ro, r] and 

0<Lo-2So<AAo{Tkj)-26o 
< AA{TK,j) < AAoiTKj) + 2So < 2M + 2So 
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for j = 1, . . . , K with probability 1. Hence, for Ms, we can construct a set 
of brackets {[Af/^(X), M/^(X)] : i = 1, . . . , J} as follows: 

K 

MHx)=Y.[ANiTKj) 
i=i 

X log{max(Af'(ri^j) - Af (r^,,_i), AAo(Ti^j) - 26o)} 

-{Af{TK,j)-Af{TK,-i)}] 

-?nA„(X) 

and 

Mf (X) =f;[AiV(r^,,)log{Af (T;,,,) - Af (r^,,„i)} 

- {AfiTKj) - Af (r^,,„i)}] - mA„(X). 

Set II • \\p^B be the Bernstein norm as defined in Van der Vaart and Wellner 
(1996) and iV[.] the braking number for the class Ms- Then, it follows from 
Condition D that 

||Mf (X) - MHx)fp,B < C3d?(Af , Af ) < C3??^ i = l,...,J 

for some constant C3 and for any A G and 

||mA(X) - mA,{X)\\ls < C4d?(A, Ao) < 046^ 

for some constant C4. So, 

log N[.]{r], Ms, II • ||p,b) < C5?7"^ 

for some constant C5. Hence, by applying Lemma 3.4.3 of Van der Vaart and 
Wellner (1996), we have 

E*\\V^{P^-P)\\j^^<ceMS) 

for some constant cq, where E* denotes the outer expectation, Pn is the 
empirical measure corresponding to X, Pnf = J27=i fiXi)/n and (pni^) = 
5^/^ + 6~^n~^/'^. Now, upon using Theorem 3.2.5 of Van der Vaart and 
Wellner (1996), di(A*, Aq) converges in probability to zero of order at least 
^-1/3 _ This completes the proof of the lemma. □ 

Now we turn to the proof of Theorem 2.1. First, note that 

(6.14) VnPf^ {X) = -hn + hn + /3n, 
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Iln = V^iPn-P)hJX), 
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hn = VnPn 



K-l 



+ WiTK,K)AniTK,K)\l 



and 



rK-i 



-.7=1 



^N{Tk,k) 

AAn{TK,K) 



r AAo{TK,j+i) - AN{Tkj+i) 



+ W{Tk,k)K{Tk,k 



AAn{TK,j+l) 

AAo{TK,j)-AN{TK,j) 

AAn{TK,j) 

.AN{Tk,k)-AAq{Tk,k) 



Let 



K-l 



+ W{Tk,k)HTk,k, 



AAn{TK,K) 

AAojTKj+i) - AN{TK,j+i) 
AA{TK,j+i) 

AAo{Tkj)-AN{Tk,j) 
AA{Tkj) 

AN{Tk,k) - AAo{Tk,k) 



Note that 



where 



AA{Tk,k) 

hn = Vn{Pn - P)gk^ {X) = hn + hn, 
hn = MPn - P){gkS^) - 9aM)} 



and 



'5n 



V^{Pn-P)gA,{x). 



It is easy to see that hn is a ?7 -statistic and has an asymptotic normal distri- 
bution with mean zero and variance cr^ that can be consistently estimated 
by (T^ as given in the statement of the theorem. Hence, it is sufficient to 
show that hni hn and hn all converge in probability to zero. 
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We will show the convergence of Iin first. Let denote the version of /i„ 
obtained by replacing A„ with A* . Then, to prove that Iin converges to zero 
in probability, it is sufficient to show that JJ'^ = Op(l), since f {A„ 7^ A* } < e. 
Let 

J^i={fA{X):AeTo}. 

Also, let {[Af , A^] : I = 1, . . . , J} be a set of r^-brackets for covering Tq with 
J < gc/r? some constant c by Theorem 2.7.5 of Van der Vaart and Wellner 
(1996). Then, for J^i, we can construct a set of brackets {[//'(X), //^(X)] : i = 
1, . . . , J} as follows: 



+ WiTK,K) 



Af{TK,,)AAo{TK,j+i) 

A^{TK,j)AAo{TK,^ 



max{Af (Ti^j) - Af (T^j^i), AAo(Ta'j) - 26o} 



K 



A^{Tk,k)Aq{Tk,k) 



max{Af (Tx.k) - Af-{TK,K-i). AAq{Tk,k) - 25o} 



and 



Af{TK,,)AA,{TK,j+i) 



max{Af (Ta'j+i) - Af (rx,^), AAo(Tx,j+i) - 25o} 



Af {Tkj) AAojTj 



+ W{Tk,k) 
It can be shown that 



Af(Tx,,)-Af(rK,,_i) 

Af{TK,K)MTK,K) 



Af{TK,K)-^HTK,K-l) 



P{fr{X)-ft{X)V<cidi{Ar,At) 

for some constant ci and for any A G JFq, Pf^{X) < C2(if(A, Aq) for some 
constant C2. Hence, .Fi is a P-Donsker class, and it follows from Lemma 2 
and Corollary 2.3.12 of Van der Vaart and Wellner (1996) that = Op(l). 
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Next, we show the convergence of l2n- Set Wq = W o Kq^ . Then, from 
Lemma 1, we can rewrite as 



'2n 



K-l 



{Wo{Ko{TK,j)) - Wo{kn{TK,j))y^n{TK,j) 



+ {Wo{Ao{Tk,k)) - WoiK{TK,K))}K{TK,K 

AN{Tk,k) 

AAn{TK,K) 



X 1 



+ A2n, 



where 



Ai„ = - P) 



K-l 



{WoiAo{TK,j)) - WoiK{TKM^n{TK,j) 

lj=l 



AA„(rA'j+i) AA„(Tkj 
+ {Wo{Ao{Tk,k)) - Wo{K{Tk,k))}K{Tk,k 

,,,, ^N{Tk,k) 

AAn{TK,K) 



and 



^2n 



K-l 



J2 {Wo{Ao{Tk,,)) - Wo{K{TKM^n{TK,j) 

/ AN{TK,j+i) AN{TK,j) \ 



X 1 



\ AAniTK,j+i) AA„,(Ta j) j 

+ {Wo{Ao{Tk,k)) - WoiAn{TK,KmAn{TK,K) 

AAn{TK,K)- 

Let A*„ and Ag^ denote the versions of Ai,j and A2n obtained by replac- 
ing An with A* , respectively. Set 

K-l 

hA{X) = Y {Wo{Ao{Tk,j)) - WoiAiTKMHTK,j) 
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I AA(Ta-,,+i) AA{TK,j) J 
+ {Wo{Ao{Tk,k)) - Wo{A{Tk,k))}MTk,k) 

f ANiTK,K) ] 
I AAiTK,K) J 

and 

T2 = {hA{X):A€To}. 

Note that the uniform covering entropy for JFq is bounded by c/rj for some 
constant c from Theorem 2.7.5 of Van der Vaart and Wehner (1996). Since 
Wq is a bounded Lipschitz function, it can be shown that for Ai, A2 G 

PHHaAX) - hA,{X)f} < C3d?(Ai, A2) 

for some constant C3 and for any A G .Fq, 

P{hl{X))<c^dj{A,Ao) 

for some constant C4. Hence, the uniform entropy for J^2 is bounded by c/r], 
and then J^2 is a P-Donsker class from Theorem 2.5.2 of Van der Vaart and 
Wellner (1996). Since di(A*,Ao) 0, it follows from the uniform asymp- 
totic equicontinuity of the empirical process [Van der Vaart and Wellner 
(1996), pages 168-171] that A*„ = Op(l). Then, we have Ai„ = Op(l), since 

For A|„, since Wq is a bounded Lipschitz function, it follows that 

l^2n 

\<C5V^dl{Al,Ao), 

where C5 is a constant. This shows, from Lemma 2 and P(A„ 7^ A*) < e, 
that A2n = Op{l). 

For /4„, we let /|„ denote the version of I^n obtained by replacing A„ 
with A* , and let 

:F3 = {gA{X)-gA,{X):AeTo}. 

We can use the same techniques as those used for proving the convergence 
of Iin to show that JP3 is P-Donsker and P{gA{X) — gA^iX)}'^ < cedi{A, Aq) 
for some constant cq, and hence /|^ = Op(l), which completes the proof of 
the theorem. 

6.2. Proof of Theorem 3.1. (i) To obtain the asymptotic distribution of 
Un, we first note that uli' can rewritten as 

(6.15) Uli^ = f/S - + U^l + U!l + USI + U^l 

V ni 
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where, for I = 1, . . . ,k, 



U- 



(0 

In 



+ W{TK,K)An{TK,K)\ 1 



AAo(T, 



K,K) 



AAn{TK,K) J . 



(0 



2n 



■ i=i 



'^lAA„,(r^,,+i) AA„,(r,^j)J 



+ W{TK,K)An,iTK,K)\l 



AAo{Tk,k) 



K-l 



Y,Wii\TK,,)K{TK,j) 

AA„,(rj^,,+i) AA„,(T;,,,) | 

Akn{TK,j+l) Akn{TK,j) i 



+ W{TK,K)An{TK,K) X 1 



AA„,(r;^,j^) 

AA„(ri^,i^) 



u, 



(0 



4n 



V^p 



K-l 



lj=l 



I AA„(Txj+i) AA„ (Ti^ J ) J 

+ - W{TK,K)yAn{TK,K) 

AK,{Tk,k. 

AAn{TK,K) 



X 1 



(0 

5n 



x-1 



J2 W{TK,){K{TK,j)-KiTK,,)} 

^ f AA„,(r^j+i)-AAo(rj^,,-+i) 

I AA„(rKj+i) 

AA„,(rj^j)-AAo(rKj) 
AA„(rxj) 
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AA„,(rx,/c)- AAo(rAv<) 



+ W{Tk,k)K{Tk,k: 



and 



■^-1 . r . 

^ w^(r^,,)A„,(r^,,) (AA„,(r^,,+i) - AAo(r^j+i)) 



1 



AA„(Ti;cj+i) AA„;(Tft'j-|.i^ 

(AA„,(r^j)-AAo(ri^j)) 
1 1 



.AA„(r/<j) AA„,(r/^j) 

- M^(T^,i^)A„,(rx,A^){AA„,(rx,^) - AKq{Tk,k)} 
From the proof of Theorems 2.1, we have, for / = 1, . . . , fc. 



c/S = y„ + op(i) 



and 



where 



VK-l 



n - Vn[Fn 2^ W{lK,j)Ao{lK,j)< . ■ T - . . ^ 

IAAo(Ja-j+i) AAo(JK,j^ 



+ T^(TV,i^)Ao(7V,/r) 1 



and 



^ VF(rKj)Ao(r^,, 



ANiTK,K) 
AAo{Tk,k) 



^ f AAf(rx,,-+i) AiV(rA-,,) 



+ WiTK,K)MTK,K)\l 



AN{T, 



K,K 



AAo{Tk,k) 

where Pmf = ^Z]je5; fi-^i) ^^'^ '^i denotes the set of indices for subjects 
in group I = 1, . . . ,k. Evidently, Yn^^s are independent and identically 
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distributed, and \fnY^ 
and Z„ 



k 

1=1 



WiYji^\ Set Zi'^ 



,Z^Jt^f. Then, 



^ V n " V " 



y(0 

n 1 



n T^(0 7 1 



and so 



where 



:r„Y„ = rY„ + op(i), 



and 



converges in distribution to Y„, , having a A;-dimensional normal distribution 
with mean vector and covariance matrix diag((Tf, . . . ,cr^), where af's are 
given in the statement of the theorem. Thus, we have Z„ converging in distri- 
bution to a random variable that has a normal distribution iV(0, Su^,), 
where is given in (3.4) of Theorem 3.1. 

Now, we need to show that , U^^ , U^''^ and Uq2 all converge in prob- 



ability to 0, / 



of U^^ , U^^ , C/52 and UqH obtained by replacing A„ with A* and with 



I, . . . ,k. Let U^n* ^ ^in*) ^5n* ^'^'^ ^6n* denote the version 



r(0 



A* , respectively. Then, to prove that ^/i^ , U^n ' ^5n ^'^d ^7g^^ all converge in 



r{0 



Til ' 



probability to 0, Z = 1, . . . , /c, it is sufficient to show that u!^^* , C/j^*, U^n* ^ 



and U^"^ all converge in probability to 0, / = 1, . , 
For ijS*, set 

g = {e:[0,r]^[0,6]}. 



,k. 



where b is the uniform upper bound of weight process Wn^ (l = 1, . . . ,k) 



+ ^{Tk,k)Ai{Tk,k] 



AA2{Tk,j+i) AA2(rxj) 
AAi{Tk,j+i) AAi{TK,j) 

^ AA2{Tk,k) 
AAi{Tk,k) 
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and, for C ^G, 

Note that it follows from Theorem 2.7.5 of Van der Vaart and Wellner (1996) 
that 

iV[.](7?,.F5,L2(P))<e^i/^ 

for some constant ci. Then, we have 

iV[.](r?,M/5(e),^2(P))<e2'=i/''. 

It can be easily shown that \^p\^^\2{^, X)\ < ^(X), where ip{X) = C2K, and 
-^V'Ai A2(^'^) — ^3^^^ where C2 and C3 are universal constants for ^. Thus, 

J[.] (5, ^5 (0 , L2 (P) ) = .^l + logiV[.](,?||^||p,2,^5(6,^2(P)) di] < c^6'/^ 

for some constant universal C4. Hence, from Theorem 2.14.2 of Van der Vaart 
and Wellner (1996), we have 

E*\ sup \V^{Pn-P)^lJA,Mi^,X)\] 

UAi,A2(e,X)GM>4(0 J 

< C5[J[.](<5,*5(e),i2(P)) + V^Pm > V^a{5)}], 
where C5 is a universal constant and 

a{5)=5\\iJ\\p,2/^Jl + logN[.]{6\\i;\\p,2,^s{0,L2{P))■ 
Then, it can be easily shown that 

limsup^*( sup \V^iPn-P)iPA,M^,X)\\<CG6'/^ 
UAi,A2(5.^)e'i'^(0 

for some universal constant cg. It follows from di(A„, A„J that 
limsup£;|V^(/'„ -P)^^, ^, iwP,X)\ 

Letting 5^0, we have 

hm E\V^{Pn-P)i;j,, (VF«,X)| =0, 

which yields t/^g^* = ^p(l)- 
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(I)* 

For U^jI , we note that 



\Ut\<C7 



K 



V^P\ Y.\Wi'\TKj-i) - W{Tkj-i) 
{j=i 



x\A:{TKj)-K^{TK,j)\j 

+ V^p[j2\WPiTK,j)-W{TK,)\ 
lj=i 

x|A:(r;,,,„i)-A;(TK,,_i)|| 

= c7(42+42+42) 

for some constant C7, where 

A?n = V^II - T^(n)||A:(^;) - Al^{v)\ df^^M 
<V^JJ \W^'\u) - W{u)\\Al{v)-Ao{v)\ dfi2{u,v) 
+ V^II \Wj!\u) - W{u)\\K^{v) - Ao{v)\ df,2iu,v), 

42 = [wn^d) - w{t)\\Ai{t) - A:,(t)i d^l,{t) 

< ^ r\wll\t)-wmK{t)-Ut)Wi{t) 

Jo 

+ r\Wi}\t) - W{t)\\Al{t) - AoWl d^,^{t) 







and 



42 = ^/^nM\v)-W{v)\\Al{u)-A*(u)\d^l2{u,v 



<V^JJ\W}!\v)-W{v)\\AUu)-Ao (u) \d^i2{u,v) 
+ V^II\W^\v)-W{v)\\Al^ (n) -AQ{u)\d^Ji2{u,v). 
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Using the Cauchy-Schwarz inequality, we have 

7ll){u) - W{u)\\K{v) - K,{v)\d^i2{u,v) 

1/2 



< 



csV^^j\wii\t)-W{t)fdi,i{t) 

1/2 

x|y^ (A;(t)-Ao(t)r(i/ii(t)- 

in probability, where cg is a constant, since 



1/2 



Similarly, we have 



u) - W{u)\\A*iv) - Ao{v)\ dfi2iu,v) = Op(l). 



Thus, A^l^ = Op{l). Similarly, we have = Op(l) and = Op(l). Hence, 
U^^* = Op{l),l = l,...,k. 

For C^s^* and C/g^*' '^^ ^^o^^ ^^^^ 



\Ut\<C9 



K 



V^P^J2\A:{TKj)-AoiTK,)\' 



K 



and 



+ V^P\J2\K^{TK,j)-Ao{TK,j)\' 
{j=i 

eg { V^dl ( A; , Ao ) + V^dl ( a;, , Ao ) } 



|i7(ri < c.oiV^djiAlAo) + V^d?(A;, Ao)} 



for some constants cg and ciq. Hence, U^^* ~ ^pi^) ^^'^ ^6n* ~ '^p(l)) ^ 
1, . . . , fc. Therefore, the proof of part (i) is complete. 



(h) We note that vi'^ = U^"''' - U^' , I = 2, . . . ,k, where U^"''' is defined 
as by replacing W,!^^ with Wn^ for I = 2, . . . ,k. Then, it follows from 
(i) that 



r(0 



(1,0 
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for I = 2, . . . ,k, and so 

+ Op(l) =HY„ + Op(l), 

where H„ and H are given in the theorem. This completes the proof of part 
(ii). 

(iii) To show that af — cr^ = Op(l) for I = 1, . . . ,k, we set 

A'-l 



f AN{TK,j+i) ANjTKj) 



+ aTK,K)AiTK,K){^l 



AN{Tk, 



K 



AA{Tk,k) 

Then, al = Pc/)'^ {W, Aq, X) and af = Pn(l)^{W^^\ X). Note that 
af -al = P„{</.2(Ty«, A„,X) - ^^{W^^\A^,X)) 

+ Pn{^^{W^a^ , Ao , X) - </>2 ( Ao , X)} 

+ (/'„-P)</)2(W,Ao,X). 
It can be easily shown that 

and 

{Pn-P)^''{W^.A^,X) = o,,{\). 
Since it follows from Condition C that 

|,^(T^i'),Ao,X)-</.(VF,Ao,X)| 
= |<A(W^«-VF,Ao,X)| 

K 

< h{l + N{TK,K)}Y.\Wi!HTK,j) - W{Tk,j)\ 
i=i 

with probability 1 for some constant bi and 

\^iWi'\Ao,X) + (l){W,Ao,X)\ = \4>{wP + W,Ao,X)\ 

<b2K{l+N{TK,K)} 

with probability 1 for some constant b2, then we have, from the Cauchy- 
Schwarz inequality. Conditions P{K < Kq} = 1 and D, and (3.6) 

E\^\wj!\Ao,X,) -4>\W, Ao,Xi)\ 



<b3E 



K,, 
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<h:,[E{l + Ni{TK,^K:)} 



4ll/2 



2-1 1/2 



e\Y.\W^\Tk.,)-W{Tk^,^) 



< 64 max 

l<i<n 



0, 



Ki 



E\Y^\Wii\TK^,)-W{TK^,) 



1/2 



where 63 and 64 are finite positive constants, which completes the proof of 
part (iii). 



Remark 3. The monotonicity assumption of the weight process re- 
quired by Zhang (2006) can be removed by using the same techniques as 
those used here. 
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